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Under  regularity  conditions,  the  infinite  set  of  moments  (when  these 
exist)  characterise  a  distribution;  thus  it  is  appealing  to  use  the  first 
s  sample  moments,  or  functions  of  these,  as  test  statistics  for  goodness 
of  fit.  For  example,  the  statistics  b^  and  b2  have  long  been  used  to 
test  for  normality,  using  the  first  four  sample  moments.  Gurland  and 
Dahiya  (1970)  and  Dahiya  and  Gurland  (1972),  in  articles  hereafter  referred 
to  as  GDI  and  GD2,  developed  this  approach  in  a  systematic  way  by  comparing 
functions  of  sample  moments  to  the  corresponding  functions  of  population 
moments;  these  functions  were  chosen  to  be  linear  in  any  unknown  parameters 
of  the  distributions. 


In  this  article  we  explore  some  theoretical  aspects  of  the  Gurland- 
Dahiya  method,  and  also  give  some  details  of  the  techniques  when  applied  to 

tests  for  the  normal,  exponential,  and  Gamma  distributions.  The  basic  test 

~  2 
statistic  of  Gurland  and  Dahiya,  below,  has  an  asymptotic  xt 

distribution  with  t*s-q,  when  q  parameters  must  be  estimated  and  s 

A 

sample  moments  are  used.  We  show  how  Q  can  be  decomposed  into  t 
*  2 

components  each  with  an  asymptotic  distribution,  and  such  that 


Qt  *  Q  i  ♦  .  Thus  as  a  new  sample  moment  is  added,  the  new  test  statistic 

contains  the  previous  statistic  plus  a  new  asymptotically  independent  term. 

a 

The  statistic  Qt  thus  breaks  down  into  components  in  a  manner  similar 
to  the  Neyman  (1937)  statistic,  or  to  EDF  statistics  (Durbin  and  Knott,  (1972) 
Stephens  (1974));  each  new  component  might  be  expected  to  test  for  a  new 
departure  from  the  tested  distribution.  When  applied  to  a  test  of  normality, 
with  s  *  3  or  4,  the  statistics  b^  and  b^  result  naturally  from  the 
method. 

Also,  we  show  that  the  test  statistic  for  exponentiality ,  using  s  *  2 
moments  with  only  the  scale  parameter  unknown  (so  q  *  1)  is  equivalent  to 
several  other  statistics  already  proposed  by  various  authors,  often  from 
quite  different  points  of  view. 

2.  THE  TEST  STATISTICS 

2.1  As  far  as  is  practicable,  we  use  the  notation  of  GDI  GD2.  The 
symbol  ' ,  in  the  matrix  algebra  below,  will  denote  the  transpose  of  a  column 
vector  or  of  a  matrix.  In  standard  notation,  the  same  symbol  will  be  used 

to  refer  to  population  moments  or  sample  moments  (uj  and  m!  respectively) 
about  the  origin.  In  context,  there  should  be  no  confusion. 

2.2  Suppose  the  null  hypothesis  is 

Hrt:  a  random  sample  X  ,X  ,  ...,X  comes  from  the  continuous 
u  1  l  n 

density  f(x;9), 

where  0'  »  (0^, —  ,0^)  is  a  row  vector  of  q  unknown  parameters.  We  assume 
for  simplicity  that  all  parameters  in  the  distribution  are  unknown,  but  the 


treatment  may  easily  be  modified  when  only  a  subset  of  parameters  in  f(x;9) 
is  unknown. 

2.3  Suppose  i  *  1 . s  are  functions  of  the  population  moments 

Uj>  j  *  l....,s,  chosen  so  that  ^  is  linear  in  0j,...,0  .  Thus, 
writing  C'  *  ),  we  have  ;  *  W0  where  W  is  an  s  x  q  matrix 

of  known  constants. 

2.4  Let  Ik  be  the  same  vector  as  but  with  sample  moments 

m!^  *  (Xr)1/n  replacing  the  population  moments  y!^  ,  and  let 

h’  =  (h,,...,hs);  h  is  a  consistent  estimator  of  ?  . 

2.5  Consider  the  vector  statistic  /n  (h-C)  »  which  measures  the  difference 
between  h  and  C  .  Its  covariance  matrix  is  I  =  JGJ'  ,  where  G  is  the 

s  x  s  symmetric  matrix  with  entries  »  and  J  is  tlie 

s  x  s  Jacobian  matrix  with  entries  *  (6ci/6y!)  ,  (i, j*l , . . .  ,s)  . 

Gurland  and  Dahiya  show  that,  if  the  (2s) -th  moment  of  f(x;8)  exists, 

^  (h-O  is  asymptotically  normal  with  mean  =  0  and  covariance  matrix  £  ; 
then  the  asymptotic  distribution  of 

CD  q  =  n(h-C)»  Z'1  (h-C) 


Further,  if  £  is  replaced  by  £  ,  a  consistent  estimator  of  £  , 
the  asymptotic  distribution  of 


M«a*g«ncrafCTwywrigKgTqr^»TanqrongtCTiar^riqr^r^iroiTqn^rCT7v^OTo%imyTi>TOngLf^  -gtrwuvuw»wr>ruw'rtnruw«^-iru . 
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is  also  xs  •  In  general,  E  contains  some  of  the  unknown  parameters  in  9  ; 

A  "*  ** 

E  is  obtained  by  replacing  9  in  E  by  6  ,  say,  where  9  is  a  consistent 
estimator  of  9  . 

2.6  Statistics  Q  and  Q*  will  be  large  when  the  sample  vector  h  is  far 
from  C  ,  but,  as  they  stand,  they  cannot  be  used  for  testing  the  fit  of  the 
sample  to  f(x;9)  because  they  contain  the  unknown  parameters  in  8  ;  in  Q 
these  arise  in  C  and  E  .  However,  when  9  is  used  to  replace  9  in  E  , 

A 

the  resulting  matrix  E  will  no  longer  be  a  function  of  9  ;  thus  9  enters 

A 

Q*  only  through  C  .  Gurland  and  Dahiya  propose  that  ,  the  minimum 
value  of  Q*  as  9  varies,  can  be  used  as  a  goodness  of  fit  statistic; 

A  A 

furthermore,  if  the  minimum  occurs  for  9  =  9,  then  9  is  an  estimate  of  9 

A  A 

analogous  to  other  "minimum  chi  square"  estimators.  The  values  of  9  and  Qt 
are  found  as  follows.  Define  matrices 


=  w»  r 


C3) 


-1  w 

and 

A  A  i 

Z  =  W  E  w 

W'  E"1 

and 

.  A 

R  >  WZ_1  W»  E~ 

(Is-R) 

and 

A  A  |  A 

A  *  E  (I  -R) 

where  E  is  obtained  as  described  above,  and  where  I  is  the  s  *  s 

A  A  A 

identity  matrix.  Note  that  Z  and  Z  are  q  x  q  ;  R,  R,  A  and  A  are 
s  *  s  and  R  and  R  are  iderapotent.  Then  (GD1^GD2^ 


(4) 


9  -  Z"1  W  E^h  ,  and 

A  A 

*  nh'Ah  .  Define  also 


Q  *  nh'Ah  . 


Note  that,  to  calculate  ,  it  is  not  required  to  know  9  . 

2.7  Notation  In  the  following  sections  we  shall  examine  the  properties 

/v 

of  Qt  and  Qt  .  New  matrices  and  vectors  will  be  defined  as  required;  for 
convenience,  the  most  important  of  these  can  be  listed  together  as  follows: 


matrix  T 
matrix  K 
matrix  L 


Z  *  TT'  where  T  is  s  *  s  lower  triangular  (definition  of  T) . 

Z  =  KK*  where  K  is  q  *  q  lower  triangular  (definition  of  K) . 

L  *  T  *WZ  ^W'(T’)  *  ,  a  symmetric  and  idemp otent  s  *  s  matrix. 

S  =  Is*L  ,  a  symmetric  and  idempotent  s  *  s  matrix. 


P  is  the  matrix  which  diagonalises  L  :  thus  L  =  PA^P'  .  The  leading  sub 


matrix  of  A.  of  order  q  is  I  and  other  entries  of  A,  are  zero. 

1  q  1 

Vector  g  of  length  s  is  g  =  T  *h  ,  with  components  g1,g2,...,gg  . 


*  P'g  «  P'T*1! 

When  Z  is  used  instead  of  Z  the  matrices  become  T,  K,  L,  S,  P  and  the 


Vector  C  of  length  s  is  ?  *  P'g  *  P'T~Ah  ,  with  components 


A  A  A  A  A 


vectors  become  g  and  £  . 


3.  THE  STATISTIC  Qt 

3.1  Gurland  and  Dahiya  (GDI)  showed  that,  on  HQ  ,  the  distributions  of  Q 

~  2 

and  the  statistic  Qt  were  each  asymptotically  xt  *  with  t  *  s  -  q  .  We 
now  extend  these  results  to  show  how  the  statistics  can  be  decomposed  into 
components. 

Theorem  1.  There  exists  a  set  of  components  ,  i  *  1,2 .  such  that 

Qj  *  C.  ,  where  t  «  s  •  q  ;  asymptotically,  the  Cj  are  independent, 

2  ~ 
each  with  a  Xj  distribution.  There  also  exists  a  set  C.  ,  such  that 

^  C.  ;  asymptotical ly  the  are  independent  each  with  a 


distribution. 
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JV  ¥\7WT*~&\  fr  VV", 


Thus  ^t+1  "  *  ct*l  and  Qt+1  *  Qt  +  ct+l  :  that  is>  a  new 

asymptotically  independent  component  is  added  to  Qt  to  obtain  »  anc^  to 

A  A 

Qt  to  obtain  . 

For  ease  of  notation,  the  proof  below  is  given  for  only;  the  proof 

for  Qt  is  very  similar.  In  what  follows,  distribution  theory  will  mean 
asymptotic  distribution  theory.  In  this  section,  when  s+1  sample  moments 
are  used  in  part  2  of  the  proof,  the  vectors  and  matrices  will  carry  an 
asterisk,  for  example,  h*,  I*,  T*  . 

3. 2  Proof  of  Theorem  1.  Part  1 

Decomposition  of  .  Suppose  I  »  TT’  where  T  is  a  lower  triangular 
s  x  s  matrix.  Then  Z  *  W'l  *W  may  be  written  Z  =  KK'  where  K  is  a  lower 
triangular  q  x  q  matrix.  (Note,  however,  that  K  is  not  W' (T')  1)  . 

Let  h  *  Tg  ,  so  that  g  =  T_1h  ;  then  Qt  *  nh'Ah  becomes 


*  rig’T'1(Is-R)Tg 


Hence 


=  ng' (Ig-L)g 


where 


L  *  T'1WZ*1W'E‘lT 
=  T*1WZ"1W' (T*)' 


a,1 

& 

pb 

jSffi 

& 

I 

1 


& 

$ 

& 


m 

n 

m 


It  is  easily  shown  that  L  is  symmetric  and  idempotent;  thus  matrix 
S  *  Is-L  is  also  symmetric  and  indempotent. 


Comment.  In  many  situations  (e.g.  for  tests  of  normality  and  exponentiality, 
to  be  discussed  in  Sections  4  and  5  below)  L  and  S  are  also  diagonal, 
with  diagonal  entries  1  or  0  .  Define  A^  and  A^  to  be 


I  0 

0  0 

q 

and  A.  = 

0  0 

z 

0  i_ 

S-q 

where  the  0  matrices  have  the  necessary  dimensions  to  make  both  A^  and 
of  dimensions  s  *  s  .  Then  L  will  equal  A^  and  S  will  equal  A^  , 
and,  from  (5),  we  have 


(6) 


ng'Sg  • 


It  at  once  follows  that,  if  t  *  s  -  q  , 


(7) 


t 

n  E  C.  , 

j-1  3 


where  component 
follows  because 

T'^CT'V  *  I  ■ 

S  *  PAjP'  where 


C.  »  ng 

1  Vj 


j  *  l,...,t  .  The  independence  of  C 


1 


g^+j  is  normal  N(0,1)  and  the  covariance  of  g  is 
When  S  *  I-L  is  not  itself  diagonal,  it  may  be  written 


P  is  orthogonal.  Then  let  £  =  P'g  ■  P'T  h  and  we  have 


(L  «  n£'A,£  -  n  Z  £?  • 


(8) 


■1  -.■’■I.v 


Thus  again  Qt  is  the  sum  of  components  =  n^q+j  ’  j  3  l,...,t  ;  each 

C  is  N(0,1)  ,  and  the  covariance  matrix  is  P'T  *Z(T  *)  '  P  *  I  ,  so  that 

2 

these  components  are  independent  and  distributed.  Corresponding  results 

A  s  *2 

hold  for  Qt  ,  which  becomes  n  ,  using  the  notation  of  Section  2.7 

Proof  of  Theorem  1.  Part  2 


To  complete  Theorem  1,  we  must  show  that  the  same  set  of  components 

/N  A 

arises  in  Qt+^  (°r  Qt  +  j)  as  i-n  Qt  (or  Qt)  with  the  addition  of  one  new 
term.  For  this  purpose  (pursuing  the  general  case  when  L  and  S  are  not 
diagonal)  we  show  how  matrix  P  is  constructed  so  that  S  =  PA2P’  or 
equivalently  L  *  PA^P'  . 

Construction  of  matrix  P.  The  s  x  s  orthogonal  matrix  P  is  constructed 
as  follows:  let  P^  *  T^WfK’)'1  ,  an  s  *  q  matrix,  and  let  P2  be  an 
s  x  (s-q)  matrix,  such  that  P^  *  0  and  P^P2  *  I  ;  then  construct 


P  *  [P.:P,] 


Lemma  1:  P  is  an  orthogonal  matrix. 


Proof-  To  show  that  P  is  orthogonal  we  need  PJP^  *  1^  .  We  have 


K'XW' (T’)’1  T'^CK*)'1 


*  K'StZ'1  W(K’)-1 
=  K^ZCK’)'1 

*  K"1KK’ (K1)'1 


■  I  as  required. 


.  *  ,  *  V  >  ^ 


Lenina  2:  LP?  =  u 


Proof.  Since  P^  *  0  we  obtain 


pyr” Lw (K  * ) ~ 1  *  o  ; 


P^T_1W  =  0  , 


(11)  LP2  =  T'1WZ‘1WCT*)'1P2  «  T'1«Z'1(P^1W)' 

Lemma  3:  P  is  the  matrix  which  diagonalises  L  ;  L  =  PAjP’  . 


Proof.  We  have 


PiLPl 


K-iw,  (T')"1T'1WZ"1W'  (T')‘1T"1W(K') 


*  K’1W,E"1WZ"1»I,E"1W(K,)~1 

=  k'”1zz”1zck,)_1 

=  K’lKK' (K')*1 


By  Lemma  2,  LP-  =  0  ,  so 


P’LP  =•  L[P.  P-] 
P, 


i1,  0 


0  0 


41  • 


Thus  the  idempotent  matrix  L  equals  PAjP'  . 

To  complete  the  proof  a  similar  procedure  is  now  followed  in  s  +  1 

dimensions.  We  have  P*  *  (T*)’1W*(K*')"1  ;  and,  as  above,  P*'P*  =  I  and 

P*'L*P?  *  I  •  Since  £  is  the  leading  submatrix  of  £*  ,  it  follows  that 

1  I  q 


where  t'  ,  v*  are  row  vectors  of  length  s  ,  0  is  a  column  vector  of 
length  s  ,  and  c  and  d  are  scalars.  Also 


where  w'  is  a  row  vector  of  length  q  .  Therefore 


where  a'  »  v’W  ♦  dw'  .  Define  matrix  V  ,  of  dimensions  (s*l)  by  (s-q) 


V  i 


where  O'  is  a  row  vector  of  length  s-q  . 

Lemma  4.  V  is  orthogonal  to  P*  . 

(K*')*1  *  P£T1W(1C*,)“1  =  0  ,  from  (10)  . 

Therefore,  if  we  define 

P2:y 

P2  * 

O’  :r 

where  y  is  a  column  vector  of  length  s  and  r  is  a  scalar  such  that 
the  last  column  simply  completes  P*  ,  we  have 

P*  =  [P*:P*]  . 


Proof.  V’  P*  *  V' 


T_1W 


a' 


Now 


Qt  =  ng'(I-L)g  =  ng'CI-PAjP^g  =  n(P’g)  ’  (I-A^P'g 

s  2  -1 
*  n  Z  £7  where  5  =  P'T  n  ; 

i=q+l  1 


that  is. 


•  «  pyr_1h 


Therefore  Q  =  nh' (T’ )_1P2P'T*1h  . 


For  Qt+1  we  have 


T  1  O  h 


(T*)_1h* 


v'  d  h  .  v'h  *  dh  , 
s  +  1  s+1 


Hence 


...  ■  P*'(T*)-1h* 


?'2  o|p_1h 


y'  r  v'h  ♦  dh 
1  s+ 


P^T_1h 


y'T_1h  ♦  r(v'h  ♦  dhg+1) 


The  first  s-q  components  of  agree  with  those  of  Qt  .  This 

result,  together  with  Part  1  completes  the  proof  of  Theorem  1  for  Qt  . 

4.  THE  TEST  FOR  NORMALITY 


4.1  We  illustrate  the  above  decomposition  with  the  test  statistic  for 
normality  given  in  GDI.  This  statistic  is  constructed  as  in  Section  2  above, 
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with  S'  *  (y^,  log  y2,  y3,  log(y4/3)}  .  Here  U2>  y^,  U4  are  moments  about 
the  mean,  and  are,  of  course,  functions  of  y ^ y^  and  y^  .  Then 
?  •  W6*  where 


W 


1 

o' 

0 

1 

V 

0 

0 

and  9  *  * 

e* 

.0 

2, 

2 

with  »  y^  ,  02  *  y2  ,  and  @2  *  log  y2  .  Vector  h  is  then  given  by 

h'  *  {m^,  log  m2>  m^,  log(m4/3)}  ,  where  m2>  m^,  m4  are  sample  central 
moments.  Then  (GDI,  equation  4.10) 


0 

2 


0 

0 


0  > 
4 


0  662  0 
4  0  32/3 


In  this  example  Z  depends  on  02  but  not  on  9^  (the  component  elements 
J  and  G  of  Z  do  depend  on  9,  but  81  drops  out  in  the  final  calculation; 
GDI  gives  the  separate  matrice 
decomposition  Z  =  TT'  gives 


GDI  gives  the  separate  matrices  J  and  G  as  though  9^  were  zero).  The 


T  * 


,®2 

0 

0 

0 


0 

2 


0 

0 

'69: 


0 

0 

0 


2^2  0  2,(273)  j 


and  T 


-1 


0 

0 

0 


0 

_1_ 

/2 

0 

-im 


'69; 


o 

o 

o 

178 


4.2  Comments.  Component  C^  is  equivalent  to  *  m‘/m2  ’  C,  is 

equivalent  to  b2  *  m^/m*  »  ^  and  ^2  are  t*'e  we^“*cnown  measures  of 

skewness  and  kurtosis,  often  proposed  for  testing  normality.  An  interesting 

'"‘1/2 

aspect  of  the  method  is  that  it  reveals  two  functions  of  moments,  C^  and 
'"1/2 

,  which  are  asymptotically  N(0,1)  and  independent,  and  which,  when 

A 

squared,  form  the  components  of  the  overall  statistic  Q2  .  GDI  gives  the 

A  A 

final  statistic  Q2  (there  called  Q  in  equation  4.15)  but  not  the 

A  A 

decomposition.  The  statistic  9  in  this  case  becomes  9  =  (mj,  log  m2)  •  . 

4.3  Extension  to  any  value  of  s.  These  results  for  s  =  4  may  be  extended 
to  more  general  s  .  For  the  calculation  of  E  set  9^*0;  then 

P2r  *  u2r  a  9^(2r) !/(2rr! )  and  odd  moments  are  zero.  GDI  gives  J  in  two 
parts,  the  Jacobian  of  the  transformation  from  central  moments  to  origin 
moments,  and  Jacobian  J2  of  the  transformation  from  £  to  central  moments; 

J  *  J_J,  .  It  may  be  shown  that  (J,)..  *  1  ,  (J.).  ■  -iyj  ,  i  >  1  ; 

(J  )  .  ■  0  otherwise;  and  (J  ) . .  *  1  if  i  is  odd,  (J_) . .  *  1/y.  if  i  is 

A  1  J  *  H  *m  11  1 

even,  and  (Jj) . .  *  0  otherwise.  Let  £  have  entries  a..  ;  i,j  =  l,...,s: 

J 


°u  *  e2;  °ii  *  0  •  1  »  1  :  °1,  '  0  ■  j  5  1 


„  .  XiiiiL  !t± L.x  tJ 

lj  l!’!  (ill)! 


both  even 


(!i)  XiiiiL 

2  (ijl)! 


Oj^  *  0  otherwise  . 


2i !  j  l 

C~) !  •' 


i,j  both  odd 


1 

M 

w? 

m 

m 

w»y 

*lv,y 


oik 


I 

>»W.* 

U 

(ft* 


KG 
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1,*  ««T»  |j‘  («'  *1* 


(  4*4  |  a  |  >  ik, 


Then  let  T  be  the  s  *  s  matrix  with  entries  t  „  defined  by 


;11  "\/e2  *  *lj  *  °  *  J  >  1  *  tU  -  0  .  i  >  1 


m  2n‘1/2 

t2m,2n  “  (n5  ^n-ljjl/2  *  n  -  m  » 


gm+i/2  C2m4-1) »  l 

e2m+l , 2n+l  '  2»-"(m.n)!  f  ( 2nH.! )  1 } 1/2  ’ 


t.  .  *  0  otherwise. 
i.J 


It  may  be  shown  that  I  *  TT*  ,  making  use  of  the  identities 


22n(i+i)! 


C2H-2j)I 


(2n) ! (i-n) ! (j-n) !  (2i)!(2j)! 


y  22n't'1(m*k*l)'.  (2mt-2k-»-2) ! 

n_0  (m-n) ! (k-n) ! (2n+l) !  ~  (2k+l) ! (2m+l) ! 


Let  U  =  T  1  and  let  U  have  entries  .  Then 


,  u  .  *  0  ,  j  >  0  ;  u.  =  0  ,  i  >  0 


'O' 


,  ,.m+njiu  vm-i 
u2m,2n  '  (-1J  Q 


,  n  <  m 


p 

m 


,*■1, 

»** 


*.»V/ 

i 


mm 


I 


tern 

Kfo: 

(Si 


m\ 

m 


SB®: 

Bsife 


(a-n) ! 2m"n  Qn*l/2  (2n*l) ! 


n 


m 


u 


2m+l,2n+l 


u. .  »  0  otherwise. 
IJ 


This  may  be  checked  by  direct  calculation  of  UT  .  Now  W  is  the  s  *  2 
matrix 


• 

1 

0 

1] 

0 

1 

FI 

0 

0 

0 

0 

2 

and  column  1  of  T"*W  is 

0 

0 

0 

• 

0 

• 

• 

• 

• 

»• 

•J 

.  •  - 

Column  2  of  T-1W  has  zeros  in  the 
positions  the  entries  are 


odd  positions  and  in  the  even 


(-l)™  E  (-l)n  n (“) 

m'1 _ n^l _ 

-,m-l/2 


1/2 


(T-1W) 


2m,  2 


l/(2^2)  when  m  *  1  ,  and  0  if  m  /  1 


Thus  T-1W  is  the  s  *  2  matrix 


We  now  compute  S  *  T  *(I-R)T  =  I-T  *RT 


Using  the  above  expression  for  R  we  find 


jSj* 


S  «  I  -  T"l[W:o]T 


i  -  [t'1w:o]t 


o  o 
o  o 


A  _  1  A_1  _ 1  ~ 
S  is  again  diagonal.  Thus  if  g  =  T  h  where  T  is  T  with  9, 

4 

replacing  02  ,  we  have  the  general  result 


m 

!!; 

w; 

m 

iOp 


Qt  *  n  r  g‘ 

i-3  1 


Now  vector  h  has  components 


hi  *  "1 


1 

ftm 


h2r.l  "  m2r+l 
h2r  ’ 


Because  of  the  structure  of  T"  ,  odd  g^^  involve  only  odd  h.  ,  and 
even  g^  involve  only  even  hi  .  We  have,  for  r  an  integer. 


*2r  s  ^  (u2r,2n)h2n 

l  .  nr-*-nfr>  ^  r-P  .  f2nn!  •> 
*  n*l  (1)  n)  2r'1/2  1  8  C2n)!  m2n^ 


/(2r‘J)  r  -n  , 

(-i)r-df£-  2  (-n'Oiogcf^-,  V 


In  particular, 


i°g(A)  .  and  g2  =  |  log  , 


as  above; 


A  /5  .  ,9  V2,  .  A2 

*6  *  T  l0gC5~)  and  8C 


5  9  b4  2 

^[log(|~|)]  . 

b2 


2  3 

where  b2  *  n^/mj  as  be^ore*  an<*  ^4  =  \/m2  * 


The  odd  gi  ate,  with  r  an  integer, 


J2r«-1  3  S.  (u2r+l,2n+l^h2n+l 
n*l 


5  ,  ,,r+n  /(2m)!  ®2n*l 

“  n-l C2n+1)!  §rl/2  2r  n(r-n) ! 


(-l)r  /C2r^ii-  l  (-l)n 


(2n+l) !  e2+1/2(r-n)! 
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In  particular,  using  9^  3  as  before,  we  have 


>v  ^  nij  ^2  1  2  m3 

83  3  -^~Tn  and  g3  3  6  bi  *  where  bi 3  ^72  :  also 


85 


— —  (b  -  10b, )  ,  where  *  -57=-  . 

^20  3  1  3  m3/ 


Thus 


a  ,  j  ,  b.  ,  c  9b.  2 

Q4  -  n(I  bj  ♦  |(log  -1)  ♦  ^  (b 3  -  10bl)2  ♦  £  (log  -4)  }  , 

DD2 


with,  on  Hq  ,  a  x4  distribution  asymptotically.  Extension  to  higher 


order 


Qt 


is  obvious. 


5.  THE  TEST  FOR  EXPONENTIALITY 

A 

5.1  In  this  section  we  give  the  general  decomposition  of  Qt  for  the  test 
for  the  exponential  distribution  F(x)  -  1  -  exp(-x/9)  ,  x  >  0  .  Following 
GD2  we  define 


U2  U3 


r'  =  fn'  —  _  J  1  ■ 

*»  u'  ’  u'  u'  J  ’ 

^1  w2  Ms-1 


then  C  *  W0  ,  where  W*  *  [l,2,...,s]  . 


Vector  h  is  Z,  with  m!^  replacing  y!^  •  The  covariance  matrix 
Z  of  /n  h  is  then  known  to  have  entries 
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We  now  write  E  *  TT*  where  the  entries  of  T  are 


hj  -  9iq:p  *  1  ^ 


i  <  j 


Then  U  =  T  has  entries 


1  (-I)1*3  fi-K  .  ^  . 
S’  j  *  1  - 3 


i  <  j 


Further  results  are:  (T  *W) '  ■  0,  0,  . . . ,  0)  ;  Z  *  1/9^  ;  R  has  first 

column  entries  l,2,...,s  and  all  other  entries  zero;  T  *R  has  1/9  in 
row  1,  column  1  and  zeros  elsewhere.  Finally  S  becomes  diagonal,  as  for 


the  normal  case: 


0  0 
-°  X*-L 


and  further  decomposition  of  S  is  unnecessary.  Let  g.  be  the  i-th 
component  of  g  =  T  h  :  using  8  as  the  estimate  of  9  in  T  ,  we  have. 


with  m^  =  1  : 


.  i  i  '-'Wj'-b  _i_ 

9  J"1  >  m’j-l 


,  i  =  1,2,.. 


In  particular,  writing  for  mj  for  ease  of  notation,  and  using  9  *  s^ 


we  have 
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1  '  *2  s  -  1  *  77  : 

2si 


i  !i  «.  S3  .  A 

'  s2  3Sls2  *  g4 


S1S2  4S1S3 


Note  that  g2  is  equivalent  to 


S2  n  2  n  2 

-4*»l  (XJVC  z  Xj 

sl  r-1  r=l 


~  t  A2 

The  test  statistic  ()  is  then  n  I.  ,  j,  . 

xt  i=«l  *l+i 

5.2  Comment .  Connection  with  a  test  for  uniformity.  Suppose  the  sample 

j 

values  X.  are  placed  in  sequence  on  a  line,  and  let  v,..  =  Z.  ,  X.  : 

i  r  (j)  i=l  i 

suppose  the  v^  are  divided  by  the  sum  of  the  Xi  ,  z  =  X.^  ,  to 

give  values  u^  »  v^/z  ;  it  is  well-known  that,  on  HQ  ,  the 
U(j)*  ^  3  l»*--»n-l  are  the  order  statistics  of  a  sample  of  size  n-1  from 
the  uniform  distribution  on  [0,1]  .  The  values  X^/z  are  the  spacings  d^ 

between  the  u^  >  and  a  test  for  uniformity,  proposed  by  Greenwood,  is 

n  2  2  A 

based  on  G  «  Z._j  d.  .  G  is  then  s2/(ns^)  and  so  component  g2  above  is 

equivalent  to  G  .  There  has  recently  been  a  revival  of  interest  in  G  , 

and  percentage  points  for  finite  n  have  been  given  by  Burrows  (1979) , 

Currie  (1981)  and  Stephens  (1981).  These  points  show  that  /n"  g_,  converges 

only  very  slowly  to  its  asymptotic  N(0,!)  distribution. 


6.  TEST  FOR  THE  GAMMA  DISTRIBUTION 

6.1  The  two  previous  illustrations,  tests  for  normality  and  exponentiality, 
were  interesting  because  the  general  form  could  be  produced  for  any  s  ,  and 


also  because  for  small  s  the  test  components  reduced  to  already  well-known 


test  statistics.  In  DG2  the  Qt  statistic  is  also  discussed  for  the 


Gamma 


distribution.  This  situation  is  important  because  although  many  tests  exist 
(EDF  tests,  for  example)  for  populations  in  which  the  unknown  parameters  are 
location  or  scale,  far  fewer  tests  are  available  for  the  case,  as  here,  where 
a  shape  parameter  is  unknown.  Unfortunately  it  does  not  appear  to  be  straight¬ 
forward  to  give  very  general  results,  but  we  now  develop  the  components  of 

A  /V 

Qj  and  Q2  . 

The  null  hypothesis  is  the  population  for  X  is 


f(x)  *  FKo  p  x“'le'X/S  :  *>0  ■■  >  o  . 


Thus  q  3  2  .  In  DG2  new  parameters  are  defined:  0^.  *  aB  ,  92  =  B 
(we  have  here  changed  the  DG2  notation) .  Then  the  vector  C  is  given  by 


<2  *3 

C  =  <  — —  — 

5  V  <x  ’  <2 . 


where  is  the  j-th  population  cumulant  =  (j-OIaB"1  .  Then  if 

0*  *  WC  ^ave  S  *  with 


1  0  0 


0  1  2 


Vector  h  is  vector  ?  with  sample  curaulants  k^  replacing  population 
cumulants  .  Considerable  algebra  gives,  for  s  *  3  , 


A  *  *- 


2a  10+8a  48+50a+6a 


Then 


0 


0 


1  Y 

2  4y 


0 

6 


and 


where  y2  =  2(l+a)  and  S2  =  6(l+a)(2+a) 


Finally 


-R  is 


T  -R  -  * 

s  3a+ 10 


0  0  0 
0-4  2 

0  -6a-28  3a+14 


and 


S  *  T-1(I  -R)T 


_1 _ 

3a+10 


0 

0 

0 


0 

4 

{12 (2+a)  } 1/2 


0 

{12(2-ks)> 

3(2+a) 


1  *  »  ul  «.l  «.t  *.1  l.<  ».a‘t.»<l.«‘>  t  i  .‘I  .n 


S  is  iderapotent  of  rank  1  as  required,  but,  in  contrast  to  the  normal 
and  exponential  cases,  S  is  not  diagonal.  Decomposition  into  S  *  PAP' 
gives 


p  s  — l -  o  {6«-3a} 

/3a+10 

0  -2 


{6+a} 


and  A„  =  0  for  all  i,j  except  =  1  . 

«  A  A  A  •  A  A 

Then  using  3.2.8  we  have  £  *  P'T  h  ,  and  £  =  P*T  h  ,  where  P  ,  T 
are  derived  from  P  ,  T  by  replacing  parameters  a  and  B  by  consistent 

A  A 

estimators  a  and  8  (GD2  use  moment  estimators).  Because  only  A^  is 
not  zero  in  matrix  A  , 


2  A 

Qt  »  nC'AC  *  nC3  ,  and  similarly  *  n£3 


where 


need  therefore  find  only  5  3  •  This  becomes 


£,  =  -ka/(10+3a)}1/2w'h 
J  B 


w'  =  "  ~  1/2’  ~  1/2^ 
(2+2ct)  '  (2+2ct) L/i 


k2  k3 

h  3  fki»  r-  ■ 
1  2 


Ktf? 

E'lVs 

m 


m 

p 

m 


m 

i 


.(  '.f  -i  >||.  i<.  |l.  iVlVl'i  |*i  #*•  fJI’**#  *-♦  *«* 


fctH 

[&!«!•! 


-  k  -  k 

1  R  2 

Moment  estimators  of  a  and  B  are  a  *  r—  and  B  =  r—  .  Thus 

K2  K1 


m 

m 

m 


2*2a  «  2(k-+k^)/k-  ,  and  — *-r  *  - — j-  ; 

1  10+3a  (10k2>3kp 


P 

kk$ 


k^kjkj^kp 


/2  k2/z(10k2+3kp1/iJ(k2+kp 


nk^kjk^k*) 

2k^(10k2+3kJ)(k2+k^) 


to! 


In  terms  of  sample  moments  this  is,  writing  s^  for  m!  as  in  Section  5, 


s  ns i  222 

li  3  2  3  2  ^1^3  *  ^2^1  ” 

1  2s2(s2-sp-S(10s2-7s‘)  1  ■*  1 


Note  that  SjS  ♦  s2s1 
moment . 


2  2 

2s.  *  m'm  "  2m  ,  where  m.  is  the  i-th  central 

fc  1  «J  +*  X 


6.2  Coimnent.  The  calculations  above  can  be  considerably  simplified  by 


m 

iii 


&& 


kl 


*2  K3  % 

starting  with  C  3  fcr-  ,  —  .  - - 

1  2  s-1 


)  ,  that  is,  by  missing  out  the  first 


component  in  the  above  treatment.  This  has  the  effect  of  reducing  the  matrix 


and  vector  dimensions  by  1.  For  s  =  4  we  now  pursue  this  approach. 


nl 

m 

I 

m 
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6.3  Test  fQT  Gamma  distribution,  s=4.  Short  method.  Let 


V  * 


^2  ^3  S 

*r  k2  ^3 


(8,26,36)  *  8 [1 ,2 ,3] 


Then  C  =>  W'8  where  W  =  [1,2,3]  ,  and 


3+ 2a 

10+ 8a 

21+I8a 

10+8a 

2 

48+50a+6a 

132+153a+27a 

21+18a 

132+lS3a+27a2 

450  +  U285  a 

315  2  2 

-y  a  +  6a 


1? 

ab 

ad 

ab 

b2+c2 

bd+ce 

jid 

bd+ce 

.2  2  * 
d  +e  +f 

where  a,b,d,d,e,f  are  defined  by  comparing  these  two  expressions  for  E 
Decomposition  of  E  gives 


E  =  TT'  where  T  =  — 


a  0  0 
b  c  0 
d  e  f 


then 


T—  1  w  y. 

“  rt 


8 


1_ 

a 

.  -L 

ac 

be-cd 


acf 


1^ 

c 

e 

cf 


and 


T-1,  -  £  1.*. 

B  c  ac 


/a  2a-b 
8  ac 


*  r  ,  say 


3  2e  t  be-cd 
f  cf  acf 


3ac-2ae+be-cd 

acf 


becomes 


L  =  T_1RT 


*  T-1WZ”1W,E~1T 

=  Z^T'W  (T’)" lT_1T  since  Z  is  a  scalar 

=  z’VWcr’)"1 

*  Z“ 1 (T~  XW3  CT*  XW) '  *  Z-1rr’ 


Define  z  to  be  the  normalised  vector  r  ,  that  is,  z  -  r/(Z)A/  .  Then 

l  *  zz*  . 


We  now  construct  P  such  that  PAP'  =  L  . 


Clearly  column  1  of  P  is  vector  z  .  From  the  solution  for  s  =  3 
we  know  column  2  has  components  (b-2a,c,0),  normalized.  Lastly  column  3  is 


(b-2a)G 

[(b-2a)2*c2]f 


We  have  Z  =  W'£~*W  =  W' (T')'^T  *W  =  r'r  ,  so  that  Z  is  scalar.  Matrix  L 


% 

I'K 


$ 

w 

W 

IVJ 


i 

I 

ft 

tol 

Vi1 

V,' 

to 

to, 

MT' 


Tjj 


rj,< 

V 
& 


& 


8 


tt> 


normalized,  with  G  *  be-cd-2ae+3ac  •  The  normalizing  factor  is  m  where 


2  22  22  22^2 
»  *  c  G  ♦(b-2a)Gz  ♦  [(b-2a)*c  ]  V 


*  [c2«-(b-2a)2]G2  ♦  fc2+-(b-2a)2]  f2 

*  [c2+(b-2a)2]{G2  ♦  [c2^(b-2a)2]f2} 


Finally  £  3  P'T  h  .  We  find  £  ,  and  can  then  replace  a  and  B  by 

—  ***  A 

estimates  a  and  8  to  obtain  £  . 

Component  £2  *s  now  the  same  as  £3  of  Section  6.1;  the  new  £3  is 


£3  =  I  [-cG,(b-2a)G,{(b-2a)2>c2}f]  -bf 


0  0 


af  O  h 


be-cd  -ae  ac 


-c2fG  -  bf  (b-2a)G  ♦  [(b-2a)2  ♦  c2]fCbe-cd) 
3  ~  *  af (b-2a)G  -  aef[(b-2a)2  ♦  c2] 

acf[(b-2a)2  +  c2l 


acf  12  (1-kx)  (3a+8) 


1_  #  /a 
m  Bacf 


-acf3Cl+a) (26+9a)  h 
acf2(l*a)(10+3a) 


WBBUaUlWWWgWg^UUJUUMI'Jlia^*^  CT  WW  WWWW  l  UWAVi  '.^yVUTVUVim  ft*  KX  wr*  rv.rv.1  wr-nr^vremi 
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Sa  (1+a) 

*  mB 

with 

m2  *  [c2+(b-2a)2] [G2+{c2+(b-2a)2}f2] 

*  2(10+3a) (a+l)12(l+a)2(2+a) (39+19ct+3a2) 

»  24(l+a)^(10+3cO (2+a) (39«-19a+3a2)  . 


12(8+3a) 

-3(26+9a) 

2(10+3a) 


We  obtain 


24(l+a) (10+3a) (2+a) (39*19a+3a2)B2 


where 


lc  lc  lc 

V  *  12(8>3a)  —  -  3(26+9a)  ♦  2(10+3a)  ^  • 

1  2  3 


~2  "2 
To  obtain  5^  we  replace  a  and  6  by  their  moment  estimators  a  =  k^/l^ 

and  B  a  *  Then 

A  A  2  A  2  A  A  A 

*  nf^4^)  3  Qi  ■*■  ^2  where  is  given  in  Section  6.1,  and 


V*  ...  «,*.«»•  .fa*.' 


.1 


7.  FURTHER  REMARKS 


The  techniques  which  have  been  explored  above  lead  to  goodness-of-fit 
statistics  based  on  sample  moments;  the  test  statistic  can  be  made  dependent 
on  more  moments  by  adding  new  components.  These  components  are  asymptotically 
independent,  and  can  be  expected  to  test  for  different  departures  from  the 
tested  distribution,  analogous  to  components  of  EDF  statistics  (Durbin  and 
Knott,  1972;  Stephens,  1974)  or  those  of  Nfcyman’s  statistic  (Miller  and 

A 

Ouesenberry,  1978).  It  has  been  shown  how  0^  ,  for  small  t  ,  leads  to 
tests  for  normality  and  exponentiality  based  on  already  well-known  statistics, 
and  the  alternatives  for  which  these  are  powerful  are  generally  recognized; 
however,  the  power  of  higher-order  components  to  detect  important  departures 
must  be  investigated.  A  related  question  is  how  many  components  to  take,  since 
the  addition  of  too  many  can  weaken  the  overall  power  against  important 
alternatives  (see  Solomon  and  Stephens  1982  for  remarks  on  Neyman’s  statistic 
which  are  also  relevant  here;  also  Durbin  and  Knott,  1972,  Miller  and 
Quesenberry,  1978) . 

We  note  also  that  the  technique  is  not  unique:  other  functions  of  moments 
can  be  made  linear  in  unknown  parameters  and  will  lead  to  other  test  statistics. 
Interesting  questions  then  remain  on  how  best  to  choose  c,  .  These  questions, 
and  the  practical  questions  of  producing  points  for  finite  n  ,  and  power 
studies,  are  currently  being  investigated. 


w 


I 
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